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Abstract 

We present a deterministic ^ ^^^^ ^ -approximation algorithm for the s-t path TSP for an arbitrary 
metric. Given a symmetric metric cost on n vertices including two prespecified endpoints, the problem 
is to find a shortest Hamiltonian path between the two endpoints; Hoogeveen showed that the natural 
variant of Christofides' algorithm is a 5/3-approximation algorithm for this problem, and this asymp- 
totically tight bound in fact has been the best approximation ratio known until now. We modify this 
algorithm so that it chooses the initial spanning tree based on an optimal solution to the Held-Karp 
relaxation rather than a minimum spanning tree; we prove this simple but crucial modification leads 
to an improved approximation ratio, surpassing the 20-year-old barrier set by the natural Christofides' 
algorithm variant. Our algorithm also proves an upper bound of on the integrality gap of the path- 
variant Held-Karp relaxation. The techniques devised in this paper can be applied to other optimization 
problems as well: these applications include improved approximation algorithms and improved LP 
integrality gap upper bounds for the prize-collecting s-t path problem and the unit-weight graphical 
metric s-t path TSP. 
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1 Introduction 



After 35 years, Christofides' 3/2-approximation algorithm HI still provides the best performance guarantee 
known for the metric traveling salesman problem (TSP), and improving upon this bound is a fundamental 
open question in combinatorial optimization. For the path variant of the metric TSP in which the aim 
is to find a shortest Hamiltonian path between given endpoints s and t, Hoogeveen (TT] showed that the 
natural variant of Christofides' algorithm yields an approximation ratio of 5/3 that is asymptotically tight, 
and this has been the best approximation algorithm known for this s-t path variant for the past 20 years. 
Recently, there has been progress for the special case of metrics derived as shortest paths in unit-weight 
(undirected) graphs: Oveis Gharan, Saberi, and Singh flOl gave a (3/2 — eo) -approximation algorithm for 
the TSP, which can be extended to yield an analogous result of a (5/3 — ei)-approximation algorithm for 
the s-t path TSP in the same special case (see Appendix lAl). Momke and Svensson ifTSll gave a 1.4605- 
approximation algorithm for the same special case of the TSP, as well as a 1.5858-approximation algorithm 
for the s-t path TSP in the same case (where the results of Appendix |A] and Momke & Svensson fTSl were 
obtained independently). We note the techniques devised in these results for the unit-weight graphical 
metric case proved useful in both path and ordinary (circuit) variants. The main result of this paper is 
to provide the first improvement for the general metric case of the s-t path TSP: more specifically, we 
give a deterministic ^i^i^^ -approximation algorithm for the metric s-t path TSP for an arbitrary metric, 

breaking the 5/3 barrier. It remains an open question whether these techniques can be extended to yield a 
comparable improvement (over the 3/2 barrier) for the general-metric ordinary (circuit) TSP. 

Our analysis gives the first constant upper bound on the integrality gap of the path-variant Held-Karp 
relaxation as well, showing it to be at most the golden ratio, ^-^y-^- We will also demonstrate how the 
techniques devised in the present paper can be applied to other problems, such as the prize-collecting s-t 
path problem and the unit-weight graphical metric s-t path TSP, to obtain better approximation ratios and 
better LP integrality gap upper bounds than the current best known. 

Proposed by Held and Karp |[T6l originally for the circuit problem, the Held-Karp relaxation lfT6l is 
a standard LP relaxation to (the variants of) TSP, and has been successfully used by many algorithms 
mull 131 121 |20l[l8l[U]. In the LP-based design of an approximation algorithm, one important measure of 
the strength of a particular LP relaxation is its integrality gap, i.e., the worst-case ratio between the integral 
and fractional optimal values; however, there exists a significant gap between currently known lower and 
upper bounds on the integrality gap of the Held-Karp relaxation. For the circuit case, the best upper bound 
known of 3/2 is constructively proven by the analyses of Christofides' algorithm due to Wolsey [22] and 
Shmoys & Williamson [21]; yet, the best lower bound known is 4/3, achieved by the family of graphs 
depicted in Figure [H^a) under the unit- weight graphical metric ifTTIl . For the path problem, Hoogeveen IITtII 
shows the natural variant of Christofides' algorithm is a 5/3-approximation algorithm, but the analysis 
compares the output solution value to the optimal (integral) solution; therefore it is unclear whether the 
algorithm yields an integrality gap bound for the Held-Karp relaxation formulated for the path problem. 
The analysis of the present algorithm, in contrast, reveals an upper bound of on its integrality gap, 
matching the approximation ratio; we also show in Appendix [Al that an LP-based analysis of Christofides' 
algorithm proves a weaker upper bound of 5/3. We observe that the family of graphs in Figure Wib) 
establishes the integrality gap lower bound of 3/2 under the unit- weight graphical metric. Note that this 
lower bound is strictly greater than the known upper bound of (3/2 — eo) on the integrality gap of the 
circuit- variant Held-Karp relaxation under the unit- weight graphical metric; this suggests that the lack of 
a performance guarantee known for the s-t path TSP matching the 3/2 for other TSP variants has a true 
structural cause. 
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Figure 1 : Examples establishing the integrality gap lower bounds for the circuit- and path-variant Held- 
Karp relaxations. 




Figure 2: Example showing 5/3 is asymptotically tight [17]: a minimum spanning tree is marked with 
thick edges. 

A feasible solution to the path-variant Held-Karp relaxation is in the spanning tree polytope; thus, given 
a feasible Held-Karp solution, there exists a probability distribution over spanning trees whose marginal 
edge probabilities coincide with the Held-Karp solution. The present algorithm first computes an optimal 
solution to the Held-Karp relaxation, and samples a spanning tree from a probability distribution whose 
marginal is given by the Held-Karp solution. Then it augments this tree with a minimum T-join, where 
T is the set of vertices with "wrong" parity of degree, to obtain an Eulerian path visiting every vertex; 
this Eulerian path can be shortcut into an s-t Hamiltonian path of no greater cost. Our analysis of this 
algorithm shows that the expected cost of the Eulerian path is at most times the Held-Karp optimum; 
the analysis relies only on the marginal probabilities, and therefore holds for any arbitrary distribution with 
the given marginals. We note that this flexibility enables a simple derandomization: a feasible Held-Karp 
solution can be efficiently decomposed into a convex combination of polynomially many spanning trees 
(see Grotschel, Lovasz, and Schrijver |15 |) and trying every spanning tree in this convex combination 
yields a simple deterministic algorithm. We also note that our algorithm differs from Christofides' in 
only one crucial respect: rather than taking a single tree and augmenting it with a T-join, we try out 
polynomially many trees and then take the one whose augmentation yields the lowest-cost path. The 
example in Figure |2]due to Hoogeveen 1 17 1 shows that this simple modification of the original algorithm is 
crucial to achieving the improved approximation ratio: if one only tries augmenting the minimum spanning 
tree, the approximation ratio remains no better than 5/3. 

As the expected cost of the sampled spanning tree is equal to the Held-Karp optimum, the rest of the 
analysis focuses on bounding the cost of the minimum T-join by providing a low -cost fractional T-join 
dominator that serves as an upper bound on the cost of the minimum T-join. First we show that the 
Held-Karp solution and the spanning tree, while being costly fractional T-join dominators themselves, are 
complementary: a certain linear combination of them is a fractional T-join dominator whose expected 
cost is no greater than 2/3 times the Held-Karp optimum, thereby recovering the same 5/3 performance 
guarantee provided by Hoogeveen's analysis of Christofides' algorithm. Based on this beginning analysis, 
we present progressively better ways of constructing a low-cost fractional T-join dominator. In all of 
these approaches, we perturb the coefficients of the tree and the Held-Karp solution to reduce the cost of 
their linear combination, at the expense of potentially violating some constraints of the fractional T-join 
dominator linear program, and then we add a low-cost correction to repair the violated constraints. To 
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construct this correction vector and to bound its cost, we show that the only potentially violated constraints 
correspond to narrow cuts having a layered structure, as illustrated in Figure[3] The layered structure allows 
us to choose disjoint sets of representative edges for each cut and to correct the violated constraints using 
a sum of vectors each supported on the representative edge set of the corresponding narrow cut. We show 
that this idea leads to a slight improvement upon 5/3, using the fact that the representative edge sets, while 
being mutually disjoint, occupy a large portion of each cut and that each narrow cut constraint has only a 
small probability of being violated. After that, we present a tighter analysis with a similar construction. 
Finally, pushing the performance guarantee towards the golden ratio requires relaxing the disjointness of 
the representatives to a notion of "fractional disjointness". We define this relaxed disjointness, construct 
the requisite fractionally disjoint vectors via the analysis of an auxiliary flow network, and prove the 
performance guarantee of ^^-/^ ■ We note that neither the fractional T-join dominator nor the narrow cuts 
are actually computed by the algorithm: these progressive analyses all analyze the same single algorithm 
while different fractional T-join dominators are considered in each analysis. That is, it might be possible 
to obtain a better performance guarantee for the same algorithm by providing a better construction of a 
fractional T-join dominator. The narrow cuts are purely for the purpose of analysis in Section [3] and never 
determined by the algorithm; however, their algorithmic use is explored in Section HI 

Section |4] demonstrates how the present results can be applied to other problems to obtain better ap- 
proximation algorithms than the current best known. We first consider the metric prize-collecting s-t path 
problem. In a prize-collecting problem, we are given "prize" values defined on vertices, and the objective 
function becomes the sum of the "regular" solution cost and the total "missed" prize of the vertices that 
are not included in the solution. For example, the prize-collecting s-t path problem finds a (not necessarily 
spanning) s-t path that minimizes the sum of the path cost and the total prize of the vertices not on the path. 
Chaudhuri, Godfrey, Rao, and Talwar [7| give a primal-dual 2-approximation algorithm for this problem. 
Prize-collecting TSP, the circuit version of this problem, has been introduced in Balas [5 |; Bienstock, Goe- 
mans, Simchi-Levi, and Williamson |f6| give aLP-rounding 2.5-approximation algorithm, and Goemans & 
Williamson lfT4l show a primal-dual 2-approximation algorithm. For both problems. Archer, Bateni, Ha- 
jiaghayi, and Karloff f3\ give improvement on approximation ratios: using the path-variant Christofides' 
algorithm as a black box. Archer et al. give a 241/121-approximation algorithm for the prize-collecting s-t 
path problem; a 97/49-approximation algorithm is given for the prize-collecting TSP, using Christofides' 
algorithm as a black box again. For the prize-collecting (circuit) TSP, Goemans lfT2l combines Bienstock 
et al. 161 and Goemans & Williamson (14] to obtain a 1.9146-approximation algorithm, the current best 
known. 

As the analysis of Archer et al. |[3l treats Christofides' algorithm as a black box, replacing this with the 
present algorithm readily gives an improvement. Furthermore, we will show that, since the present analysis 
produces the performance guarantee in terms of the Held-Karp optimum, Goemans' analysis |[T2l can be 
extended to the prize-collecting s-t path problem. One obstacle is that the parsimonious property |fT3l used 
in Bienstock et al. does not immediately apply to the path case; however, we prove that a modification to 
the graph and the Held-Karp solution allows us to utilize this property. This yields a 1.9535-approximation 
algorithm for the prize-collecting s-t path problem; the same upper bound is established on the integrality 
gap of the LP relaxation used. 

Secondly, we study the unit-weight graphical metric s-t path TSP to present a 1.5780-approximation 
algorithm. As discussed above, there has been progress for this special case in both the ordinary (circuit) 
TSP and the s-t path TSP. In Appendix IaI we show how the results of Oveis Gharan et al. f20l extend to the 
path case. Most recently, Mucha 1 19 1 gave an improved analysis of Momke & Svensson's algorithm ifTSl 
to prove the performance guarantee of 13/9 for the circuit case and 19/12 + e for the path case, for any 
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e > 0. We observe that the critical case of this analysis is when the Held-Karp optimum is small, and 
we show how to obtain an algorithm that yields a better performance guarantee on this critical case, based 
on the main results of this paper. In particular, we devise an algorithm that works on narrow cuts to be 
run in parallel with the present algorithm; this illustrates that the narrow cuts are a useful algorithmic tool 
as well, not only an analytic tool. Our algorithm establishes an upper bound on the integrality gap of 
the path-variant Held-Karp relaxation under the unit-weight graphical metric, which does not match the 
performance guarantee but smaller than ^^^^ ■ 



2 Preliminaries 

In this section, we introduce some definitions and notation to be used throughout this paper. 

Let G = {V,E) be the input complete graph with metric cost function c : E ^ M_|_. Endpoints 
s,t & V art given as a part of the input; we call the other vertices internal points. 

For A,B C V such that Af^ B = ^, E{A, B) denotes the set of edges between A and B: i.e., 
E{A, B) = {{u, v} G E\u e A,v e B}. Let E{A) denote the set of edges within A: E[A) := {{u, v} G 
E\u,v G A}. 

For nonempty U V, let {U, U) denote the cut defined by U, and 5{U) be the edge set in the cut: 
5{U) = E{U, U). {U, U) is called an s-t cut if \U Ci {s,t}\ = 1; we call {U, tj) nonseparating otherwise. 

For x,c G and F C E, x{E) is a shorthand for Ylifi^F^f'^ '^(^) YleeE'^f^^e- The incidence 
vector xf £ of F C -E is a (0, l)-vector defined as follows: 



(X 



F e 



1 if e G F, 
otherwise. 



For two vectors a, 6 G M^, let a * 6 G denote the vector defined as: 

(a * b)i := aibi. 

Definition 1 ( 1. 16 J ). The path-variant Held-Karp relaxation is defined as follows: 
minimize c{x) 

subject to x{5{S))>l, VSC y,|{s,t}n5| = 1; 

x{5{S))>2, VScy,|{s,t}n5| /l,5/0; 

x{5{{s})) = x{5{{t})) = I- 

x{5{{v]))=2, V^;Gy\{s,t}; 

X > 0. 

This linear program can be solved in polynomial time via the ellipsoid method using a min-cut algo- 
rithm to solve the separation problem |[T5l . The following observation gives an equivalent formulation of 

o. 

Observation 1. Following is an equivalent formulation of 
minimize c{x) 

subject to x{E{S)) <\S\-1, V5 C {s, t} ^ 5, 5 / 0; 

x{E{S)) < \S\-2, V5 c y, {s, 4 ^ 5; 

x{5{{s})) = x{6{{t})) = 1; 

x(5(M))=2, yveV\{s,t}; 

x>0. 
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Definition 2. For T C V and J C E, J is a T-join if the set of odd-degree vertices in G' = {V, J) is T. 

Edmonds and Johnson lITOl give a polyhedral characterization of T-joins: let Pt{G) be the convex hull 
of the incidence vectors of the T-joins on G = {V, E)\ Pt{G) + R;^ is exactly characterized by 

{y{5{S))>l, V5cy,|5nr|odd; 

\y e Mf . 

We call a feasible solution to ^ Si fractional T-join dominator. 

Lastly, the polytope defined by the path- variant Held-Karp relaxation is contained in the spanning tree 
polytope of the same graph; thus, given a feasible solution x* to the path-variant Held-Karp relaxation, 
there exist spanning trees . . . , 5^ and Ai, . . . , G M+ such that x* = Yli=i and J2i=i — 

where k is bounded by a polynomial. This follows from Grotschel, Lovasz, and Schrijver |[T5l . 

3 Improving upon 5/3 

We present the algorithm for the metric s-t path TSP and its analysis in this section. 

3.1 Algorithm 

Given a complete graph G = {V,E) with cost function c : E ^ and the endpoints s,t G V, 
the algorithm first computes an optimal solution x* to the path-variant Held-Karp relaxation. Then it 
decomposes x* into a convex combination ^ Kxs^i of polynomially many spanning trees . . . , 
with coefficients Ai, . . . , A^ > 0; a spanning tree is sampled among these spanning trees ^i?, from the 
probability distribution given by Aj's. This decomposition can be performed in polynomial time, as noted 
in Section|2] Let T C ^ be the set of the vertices with the "wrong" parity of degree in i.e., T is the set 
of odd-degree internal points and even-degree endpoints in ^ . The algorithm finds a minimum T-join J 
and an s-t Eulerian path of the multigraph ,5/" \J J. This Eulerian path is shortcut to obtain a Hamiltonian 
path H between s and t; H is the output of the algorithm. 

We note that this algorithm can be derandomized by trying each ^ instead of sampling Observe 
that E[c{H)] < pc{x*) implies that the derandomized algorithm is a deterministic p-approximation algo- 
rithm. 

In the rest of this section, we prove the following theorem. 

Theorem 1. The present algorithm returns a Hamiltonian path between s and t whose expected cost is 
no more than ^'^^ c{x*). Therefore, there exists a deterministic ^ ^^2^ ) -(approximation algorithm for the 
s-t path TSP. 

Corollary 1. The integrality gap of the path-variant Held-Karp relaxation is at most 

3.2 Proof of 5/ 3-approximation 

In this subsection, we present a simple proof that the present algorithm is a (expected) 5 / 3-approximation 
algorithm. Improved analyses are presented in later subsections based on this simple proof. 

We can understand the well-known 2-approximation algorithm for the circuit TSP and Christofides' 
3/2-approximation algorithm as respectively using the minimum spanning tree and half the Held-Karp 
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solution |[22l |2TI as a fractional T-join dominator. Let us consider whether x;7 ^rid x* can be used to 
bound the cost of a minimum T-join in our case. 

It can be seen from ([T]) that ^x* is a fractional T-join dominator for /3 = 1. If it were not for the s-t 
cuts, the same could be shown for /3 = ^. However, an s-t cut may have capacity as low as 1, making it 
hard to establish the feasibility of fix* for any /3 < 1. 

axsr also is a fractional T-join dominator for a = 1; in this case, however, s-t cuts do have some 
slack. Suppose that an s-t cut ([/, f7) is odd with respect to T: i.e., |[/ n T| is odd. Since U contains 
exactly one of s and t, U contains an even number of vertices that have odd degree in \S{U) n ^\ 
is given as the sum of the degrees of the vertices in U minus twice the number of edges within U, and is 
therefore even. This shows x.t{H^)) ^ ^ and hence ax.T for a = ^ does not violate (O as far as s-t cuts 
are concerned. It is the nonseparating cuts that render it difficult to show the feasibility of ax,^ for a < 1. 

Given the difficulties in these two cases are complementary, it is natural to consider ax,^ + Px* as a 
candidate for a fractional T-join dominator; Theorem |2] elaborates this observation. 

Theorem!. E[c{H)] < ^c{x*). 

Proof. Let y := axsr + for some parameters a, /? > to be chosen later. We examine a sufficient 
condition on a and /3 for y to be a feasible solution to (O. 
It is obvious that y > 0. 

Consider an odd cut ([/, f7) with respect to T: i.e., |C/ n T| is odd. We have n 5^1 > from 

the connectedness of .'^ . Suppose that (?7, C7) is an s, t-cut; then |(5(?7) H is even as previously argued. 
Thus, 

y{8{JJ)) = a|5(C/)n=^|+/3x*(5(C/)) 

> 2a + /3. 

Suppose that ([/, IJ) is nonseparating; then we have x*((5(?7)) > 2 from the Held-Karp feasibility, and 
hence 

y{K^)) > a\6{U)n£r\+i3x*{6{U)) 

> a + 2p. 

Therefore, if 2a + /? > 1 and a + 2/3 > 1 then y is feasible. Now we bound the expected cost of H: 

E[ciH)] = E[c(^)]+E[c(J)] 
< E[c(^)]+E[c(y)] 
= E[c(^)]+E[c(ax,^)]+E[c(/3:E*)] 
= (1 + a + /3)c(a;*), 

where the second line holds since y is a fractional T-join dominator. Choose a = I and /? = 4. □ 



3.3 First improvement upon 5/3 

We demonstrate in this subsection that the above analysis can be slightly improved. 

Recall that the lower bound on the nonseparating cut capacities of y was given as a + 2/3 in the previous 
analysis; consider perturbing a and /3 by small amount while maintaining a + 2/3 = 1. In particular, if 
we decrease a by 2e and increase /3 by e, we decrease the expected cost of y by ec(x*), without changing 
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a + 2/3; that is, if we can fix the possible deficiencies of y in s-t cuts with small cost, this perturbation will 
lead to an improvement in the performance guarantee. 

Note that s-t cuts with large capacities are not a problem: iaxsr + fix*){b{\J)) > 2a + /3x*((5(C/)) 
and thus, if x*(5(C/)) is large enough, the bound remains greater than one after a small perturbation. 

On the other hand, cuts with x*{b(\J)) = 1 are also not a concern. = E[|5(C/) n and 

I <r\ ^\'>\ from the connectedness of hence H is identically 1 and |f/ n Tj is always 

even. Formulation ^ constrains the capacities of only the cuts that are odd with respect to T, so the 
capacity of this particular cut {U,U) will never be constrained. In fact, for an s-t cut {U,U), 

Pr[|;7nT| isodd] < Pv[\6{U) n ^\ > 2] 
< E[\6{U) n ^\] - 1 

= x*{6{U))-l. (3) 

We will begin with y ^ olxst + /3x* for perturbed a and /3, and ensure that y is a fractional T- 
join dominator by adding small fractions of the deficient odd s-t cuts. Yet, a cut being odd with small 
probability as shown by (O does not directly connect to its edge being added with small probability, since 
an edge belongs to many s-t cuts. We address this issue by showing that the s-t cuts of small capacities 
are "almost" disjoint. 

First, consider the s-t cuts (?7, [7) whose capacities are not large enough for 2a + /3x*((5(C/)) to be 
readily as large as 1; the following definition captures this idea. Let r := '^—^ — 1- 

Definition 3. For some < r < 1, an s-t cut ([/, [7) is called r-narrow if x*{6{U)) < 1 + r. 

The following lemma shows that r-narrow cuts do not cross. 

Lemma 1. Let < r < 1. For Ui B s and U2 B s, if both (C/i,C7i) and {U2,U2) are T-narrow, then 
Ui C U2 or U2 C Ui. 

Proof. Suppose not. Then both Ui \ U2 and U2 \ Ui are nonempty and 

x*{5{Ui))+x*{5{U2))>x*{5{Ui\U2))+x*{5{U2\Ui)) > 2 + 2 = 4; (4) 
on the other hand, 

x*{6{Ui)) + x*{6{U2)) < 2 + 2r < 4, 
contradicting (|4]|. □ 

Lemma [J shows that the r-narrow cuts constitute a layered structure, as illustrated in Figure [3l 

Corollary 2. There exists a partition Li,...,L£ofV such that 

1. Li = {s}, Li = {t}, and 

2. {U\{U, U) is T-narrow, s eU} = {Ui\l < i < £}, where Ui := ^{^iLk- 
Let L<j denote U^^^Lfc and L>j denote U^^^L^. Ui = L<i. 

Now we show that r-narrow cuts are almost disjoint: for each r-narrow cut (C/j, Ui), we can choose 
Fi C 6{Ui) that occupies a large portion of 6{Ui) and mutually disjoint. 

Definition 4. Fi := E'(Lj, L>j+i). 



7 




Figure 3: 0.05-narrow cuts of a feasible Held-Karp solution {£ = 6). F3 is marked with thick edges. 

Lemma 2. For each r-narrow cut {Ui, Ui), x*{Fi) > > 1 - i 

Proof. The lemma holds trivially for i = 1. Suppose 2 < i < £ — 1. We have 

1 + T > X* i5{U^-l)) = X* {E{L<i.i, Li)) + X* {E{L<^.i , L>i+i)) (5) 

and 

x*{6{Ui)) = x*{E{L„L>,+i))+x*{E{L<i.i,L>i+i)). (6) 

From and ©, 

x*{6{Ui)) - 1 - T < x*(E(Li,L>,+i)) -x*(i^(L<,_i,Li)); 

on the other hand, 

2 < X* {6{Li)) = X* {E{L„L>,+i)) + x* {E{L<i_u U))- 

Thus, 



, l-r + x*(,5([70) ^ ^ r 
X (Fi) = X (F(Li,L>i+i)) > ^ 2' 



□ 



It is obvious that Fj's are disjoint and Fj C 5{Ui). For each r-narrow cut f/j, we define /^^ as 

(/f/Je : 



otherwise. 



Theorems. E[c(i7)] < 1.6577c(x*). 
Proof. Let 

^ « * ^ V- 1 - (2a + ^) 

i:\Uir{T\ is odd, \<i<i 2 

for a = 0.30, /3 = 0.35 and r = ^-^^ — 1 = We claim y is a fractional T-join dominator. It is obvious 
that y > 0, and we have argued that y{6{U)) > 1 for nonseparating {U, U). Suppose {U, U) is an s-t cut 
with \ U riT\ odd. If {U, U) is not r-narrow, then 

y{6{U)) > a\6{U)n^\+(5x*{S{U)) 
> 2a + /3(l + r) 
= 1. 
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If ([/, U) is r-narrow, then 



yisrn > a\6{U)n£r\+i3^*{5{U)) + ^-^^±^fmu)) 
= 1. 

Thus y is a fractional T-join dominator. Now it remains to bound the expected cost of H. Let A :- 

1 - (2a + 13) 



1 



E[cm = E[c(=^)]+E[c(J)] 



E[c{jr)] + E[c{ax.T)] + E[c(/3x*)] + E 
/i-i 

(1 + a + (3)c{x*) + c J]]Pr[|C/i n T\ is odd] -A-flj^ 



i:|(7inT| is odd, l<i<£ 



From (O, 



E[c(/7)] < (l + a + /3)c(x*) + TAc(^^/^^j 
< {l + a + P + TA)c{x*), 
where the last line follows from the disjointness of Fj. Note that 1 + a + (3 + tA < 1.6577. □ 

3.4 A tighter analysis 

In the previous analysis, we separately bounded the probability that a r-narrow cut is odd, the deficit of the 
cut, and moreover, we used 1 — § instead of from Lemma|2l These observations 

lead to some improvement, as shown in the following theorem. 

Theorem 4. E[c{H)] < ^^^c{x*). 

l-T + X*id{Ui)) 



Proof. Let fej : 
Let 



denote the lower bound of {6{Ui)) given by Lemma|2l 



l-{2a + /3x*{5{Ui))} 



y ■■= ax.^ + /3x* + ^ 

i:\Uir\T\ is odd, l<i<l 

where a and (3 are to be chosen later; r := — 1. As in the previous subsection, {C/j} and {Li} denote 
the r-narrow cuts and their layered structure. Assume \ < P <\ and 1 — 2/3 < q < . 
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A similar argument as in Theorem [3]proves that y is a fractional T-join dominator; it can also be shown 

that 

E[cm < (1 + a + (3)c{x*) + c (j2 MU. n T\ is odd] ^ " + ^^'^('^(^0)} jA 



'l-l 



< (l + a + /3)c(x*) + 



^1=1 

l-{2Q + /3(l+a;)} 

max I uj 



-1 



C 



, 1 - {2a + /3(1 +w)} w[l - {2a + + w)}l 
Let := w i ^ = ^ , ^ , -^^ We have 

2"^2 2 2^"'"/3"'"2 

-1^2 + (1 _ 2a - 3/3)a; +(2-Aa-l^-^ + f-^ 

^'(^) = 7 72 

I - - — + - + - I 

1^2 2/3^/3^2^ 

and the unique solution to 

. s . 1 — 2a 
i?'(a;) = {0<uj< — 1) 



uj = ojo:=^(l-2a-3/3 + v/(-2/3)(l - 2a - 3/3)) . 

Since R{oj) > for < w < - 1 and R{0) = R{^^ - 1) = 0, R{u}) is maximized at w = ojq; 
hence, from ([7]l, 



E[c(H)] < 5a + 11/3 - 1 - 4V(-2/3)(l - 2a - 3/3) c(x*) 



Choose a = /3 = | — and we obtain 



mm < ^—^c{x*). 



□ 



3.5 Proof of ^ i^Y^ j -approximation 

In this final subsection, we show that E[c(iJ)] < i^c(x*), proving Theorem [T] and Corollary [T] 

In the previous analyses, Fj's serve as "representatives" of r-narrow cuts. These representatives are 
useful since they have large weights while being disjoint. We improve the performance guarantee by 
introducing a new set of representatives that are "fractionally disjoint". Note that the three key properties 
of {/{} } used in the proof of Theorem |4] are: 
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1. > for all i; 

2- Et!/^,<x*;and 

3. flr^{S{Ui)) > for all i. 

{flj.} chosen in the previous analyses also satisfies that, for any given e ^ E, {fij) ^ 7^ for at most one 
i. However, this was not a useful property in the analysis; Lemma [3] states that, by relaxing the definition 
of disjointness, we can choose {/j}.} that have larger weights. The definitions of r, {Ui} and {Li} are 
unchanged. 

Lemma 3. There exists a set of vectors {fij-YiZk ^<^tisjymg: 
1. f^^ G Rf for alii; 

2- Ei=i fli, ^ a;*; and 

3- f%{mi)) > I for alii. 

This lemma is proven later; based on it. Lemma [3] proves the desired performance guarantee. 
Lemma 4. E[c{H)] < i^c(x*). 
Proof. Let 

y := ax.^ + I3x* + ^ [1 - {2a + (3x*{6{Ui))}] fl,^, 

i:\UinT\ is odd, l<i<i 

where a and /? are parameters to be chosen later, satisfying 

-</?<- and l-2/3<a< (8) 
3 2 2 

By following the same argument as in Theorem \4\ we can easily show that y is a fractional T-join 
dominator; the only slight difference is when ([/, U) is r-narrow and |[/ n T| is odd, where we have 

ym)) > a\5iU)n^\+(3x*i6{U)) + [l-{2a + Px*i6iUi))}]flji6{U)) 
> 2a + Px*{5{U)) + [1 - {2a + Px*{5{Ui))]] ■ 1 
= 1, 

from the first and the third properties of Lemma[3l Hence, y is a fractional T-join dominator. 
Now it remains to bound E[c(ff)]. 

E[c(if)] < E[c(^)]+E[c(y)] 

= (1 + a + P)c{x*) + c Vv[\Ui n T\ is odd] [1 - {2a + Px*{5{Ui))]] 

< {l + a + fi)c{x*) + c (Y^{x*{5m) - 1} [1 - {2a + Px*{5{Um /^^^ 

< (l + Q + /3)c(x*) + | max^6.[l-{2a + /3(l + u;))}]|c|'^/^V (9) 
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Figure 4: A feasible Held-Karp solution (i = 3) and its corresponding flow network. 



From the second property of Lemma [3l 

Hc{H)] < \l + a + /3+ max uj[l- {2a + /3(1 + w))}] } c{x*] 
1 + a + /3 + max /3u!{t — w) > c{x*) 

0<UJ<T J 

^ (1-20-/3)2] , 
l + a + /3 + ^ i^-^r^"^^- 

We choose a = 1 - and /3 = □ 



Proof of Lemma^ Consider an auxiliary flow network illustrated in Figure HJ consisting of the source 
^source^ sink i;^'"*^, a node for each 
r-narrow cuts. The network has arcs of: 



^source^ sink v^^"^ , a node v"^^ for each r-narrow cut U , and a node for each edge e in one or more 



1. capacity 1 from ^1=°"''^^ to v'fj^ for every r-narrow cut U; 

2. capacity oo from v'^^ to v^^^ for every e G (^(t^), for all U\ 

3. capacity x* from wl*^^^ to w^'"'^ for every vf^^^ . 

Let g be this capacity function. 

Let (5, 5) be an arbitrary cut on this flow network, where v^°"'"^^ G S. We claim the cut capacity of 
(5, S) is at least £ - 1. 

Suppose there exists a r-narrow cut U and e G (^(t^) such that f G 5" and ul*^^^ ^ S; the cut capacity 
is then oo. So assume from now that (abusing the notation) every edge in any r-narrow cut in S is also in 
S. Let S n {u^f |1 <i<l] = {u^f , W(>f , • • • , } for some 1 < ii < ia < • • • < 4 < ^- The cut 
capacity is then at least 

= (£-l-fc)+ ^ x:; 

eiBjjg'teS ee<5(C/) 
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Figure 5: Schematic diagram: i = 6, k = 3, ii = 2, 12 = 3, and = 5. 



if /c = 0, the claim holds; the claim also holds for k 
Figure [5]l. 



1 since > 1. Suppose k > 2 (see 



E 



'^*m^.)) + ^x*i6iU,^ \ [/,^„J) + x*i6iV \ U,,)) 
i=2 



> -[l + 2(fc-l) + l] 
= k, 

proving the claim. 

Thus the maximum flow on this flow network is of value at least ' 
this flow saturates all the edges from v^°^'''^^ to v^'^, since the cut capacity of {1;^°"'''^^}) is £ — 1. 

Now, for each r-narrow cut U, define (/{})e as the flow from vf^^ to vt^^^ if e G S{U), and otherwise. 



1. Consider a maximum flow; 

source 1 



Then the first property is satisfied from the definition of flow; the second property is satisfied from the 

; lastly, the third property is satisfied since every edge from v^°"'^'^^ to 

□ 



capacity constraints on vt^^^ to v^'"^ 



v^^ is saturated 



4 Application to other problems 

In this section, we exhibit how the present results can be applied to other problems to obtain approximation 
algorithms with better performance guarantees than the current best known and improved LP integrality 
gap upper bounds. 



4.1 Prize-collecting s-t path problem 

We discuss the prize-collecting s-t path problem in this subsection. 

Definition 5 (Metric prize-collecting s-t path problem). Given a complete graph G = {V, E) with s,t ^V, 
metric edge cost function c : E ^ M_|_, and vertex prize it : V ^ M+, the metric prize-collecting s-t path 
problem is to find a simple s-t path P that minimizes the sum of the path cost and the total prize "missed ", 
i.e., c{P) + tt{V \ V{P)). 

The s-t path TSP can be considered as a special case of the prize-collecting s-t path problem, where 
Tr{v) = 00 for all v € V. 

Archer et al. [3] use the path-variant Christofides' algorithm ifTTl as a black box to obtain a Hj- 
approximation algorithm for the metric prize-collecting s-t path problem. ^ < 1.9918. 
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Theorem 5 (Archer et al. IS). Given a p-approximation algorithm si for the metric s-t path TSP, one can 
obtain a ^2 — (^fq:^^ ^-approximation algorithm for the metric prize-collecting s-t path problem that 
uses si as a black box. 

This theorem, combined with Theorem[T] readily produces an improvement. ^^"^ < 1.9889. 

Corollary 3. There exists a ^ ^^"^"^ ^ -approximation algorithm for the metric prize-collecting s-t path 
problem. 

However, as the performance guarantee established by Theorem [T] is in terms of the Held-Karp op- 
timum, the theorem enables a further improvement via an analysis analogous to Goemans (12). For the 
metric prize-collecting traveling salesman problem, Goemans [12J combines the LP rounding algorithm 
due to Bienstock et al. |6| and the primal-dual algorithm of Goemans & Williamson lfT4l (with the obser- 
vation of m and iS) to achieve the best performance guarantee known for the problem. 

One obstacle in applying this approach to the prize-collecting s-t path problem is that, unlike the 
circuit-variant Held-Karp relaxation, the path-variant Held-Karp relaxation cannot be written as a set of 
edge-connectivity requirements between the pairs of vertices: the relaxation requires nonseparating cuts to 
have capacity of at least 2, whereas the edge connectivity between any two vertices can be as low as 1 in 
both a feasible Held-Karp solution and a (integral) Hamiltonian path. We will show that, despite this fact, 
the parsimonious property [ 13] still can be used, and will analyze the performance guarantee given by the 
approach. 

We start with the following LP relaxation of the problem: 
minimize c(x) + 7r(l — y) 

subject to x{6{S))>l, ySCV,\Sn{s,t}\ = l; 

x{6{S))>2y^, V5C y,5n{s,t} = yv£S; 

x{6{{s})) = x{6{{t})) = 1; (10) 

x(5(W))=2y„, yveV\{s,t}; 

Xe > 0, Ve G E; 

0<yv<l, yv€V\{s,t}; 

where 1 denotes the all-1 vector in 1/ G M^^^"'*^. It can be easily verified that this is a relaxation of the 
prize-collecting s-t path problem. 

Given V' C V \ {s, t}, consider a related problem of finding a minimum s-t path on G that visits all 
the vertices in V, and only those vertices. The following LP is a relaxation to this problem: 

minimize c(x) 

subject to x{6{S))>l, VS C V,\S D {s,t}\ = 1; 

xld{S)) > 2, V5 C F, 5 n {s, i} = 0, 5 n y / 0; 

x{6i{s})) = xmt})) = i; (11) 

x{6{{v})) =2, Vz; G V; 

xmv}))=0, yveV\{s,t}\V'; 
Xe > 0, Ve G E. 

Observation 2. Let G' = {V' U {s,t},E') be the subgraph of G induced by V' U Projecting a 

feasible solution to dill ) to E' yields a feasible solution to the path-variant Held-Karp relaxation for G'. 
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The following lemma shows that we can use the parsimonious property. 



Lemma 5. The optimal solution value to (|11|) is equal to the optimal solution value to the following 
relaxation without the degree constraints: 



Proof. Let G = {V, E). It suffices to show that, given a feasible solution x* to (fT2l) . how to construct a 
feasible solution to (ITTI ) whose cost is no greater than c{x*). 

We will extend the graph (and x*) so that the relaxation (almost) becomes a set of edge-connectivity 
requirements between pairs of vertices, and then use a similar approach as in Bienstock et al. IH, along 
with the following lemma: 

Lemma 6 (||6l). Let G = {V, E) be an Eulerian multigraph. Suppose that, for some U C V and v £ V, 
any two vertices in U other than v are k-edge-connected. Let x be an arbitrary neighbor of v; then, there 
exists a neighbor yofv such that 

L X ^ y; and 

2. any two vertices in U other than v are still k-edge-connected after splitting (x, v) and (y, v): i.e., 
replacing {x, v) and {y, v) (one copy each) with (x, y). 

Without loss of generality, we can assume x* is rational. 

Now we add three new vertices to the graph: s' , t' and u. We set c(s', v) = c{s, v) and c(t', v) = c{t, v) 
for all v; c{s' , s) = c{t', t) = 0: s' and t' will be the "proxy" of s and t. We do not define the cost between 
u and other vertices: these costs do not affect the rest of the analysis. However, for notational convenience, 
we set these costs to be zero, potentially violating the triangle inequality. Let G = {V, E) be this extended 
graph. 

We extend x* into x* as well: x*(s, s') = x*{s', u) = x*{u, t') = x*{t', t) = 1, and all other newly 
added edges are set to zero. Note that the (fractional) degree of s', t' and u are 2. 

Let V' := V U {s' ,t' ,u}; we. claim that any two vertices in V' are 2-edge-connected. 

Claim 1. For anyScV such that V' n S ^ (l> and V' \ S ^ x*{d{S)) > 2. 

Proof. Without loss of generality, assume s £ S. If t ^ S, then at least one edge of the path P : 
s — s' — u — t' — tisin S{S); thus, 



minimize c{x) 

subject to x{5{S)) > 1, V5 C V, \S n {s, t}\ = 1; 

x{6{S)) > 2, ys cv,sn {s, t} = 0, s n / 0; 

Xe > 0, Ve G E. 



(12) 



x*{6{S)) > x*{5g{S n V)) + x*{6{S) n P) > 1 + 1. 



Suppose t G 5. If {s', u,t'}\S^^ then \5{S) n P\ > 2; hence. 



x*{6{S)) > x*{6{S)nP) > 2. 



Otherwise, V \ S = V \ S Hi and thus. 



x*{6{S)) > x*{6G{SnV)) > 2, 



since {Sr\V)nV' C V'. 



□ 



15 



Now scale x* by some large constant C so that z* := Cx* is integral and, in the multigraph on V 
whose edge multiplicities are given by z* , the degree of every vertex is even. Note that any two vertices in 
V' are 2C-edge-connected in this multigraph. 

Let (f) := Yjv:^V'[^*^^^'^)) ~ 2C] + Y^vev\V' ^*('^(^)); ^^en integer. We will modify z* until 

reaches 0: in particular, we split two edges in the multigraph so that 

(i) (j) decreases by 2; 

(ii) c{z*) do not increase; 

(iii) any two vertices in V' are 2C-edge-connected; 

(iv) the degrees of s' , t' and u all remain 2C; 

(v) the only edges incident to u are (s', u) and {u, t'); and 

(vi) every vertex has even degree and hence the connected component containing V' is Eulerian. 

It is clear that the invariants (Iml ) through initially hold. 

If there exists an edge that is not reachable from any vertex in V', we can remove all such edges without 
violating any of the conditions (cj) may decrease by more than 2). 

If there exists v ^ V \ V' such that z*{6{v)) > 0, then we apply Lemma[6]to pick two incident edges 
to split. Note that v ^ {s',t',u} since s',t' ,u £ V'. (Iml) is maintained from the lemma. Splitting does not 
change the degree of any vertex other than v; hence (Q), and ((vll) are satisfied. Neither of the chosen 
edges is incident to u, as can be seen from (tv]l; thus, (|v]l is maintained and ^ follows from the triangle 
inequality. 

Otherwise, we choose v e V' such that z*{6{v)) > 2C. z*{6{v)) > 2C + 2 from Again 
V ^ {s' ,t' ,u} from we can similarly verify all properties in this case as well. 

Once (j) reaches 0, we remove u and its incident edges. None of these edges got split during the process: 
this is the reason why the cost of these edges can be left undefined. 

Note that the degree of s and t now are 0, whereas s' and t' are 1. Concatenate s and s', and t and t', 
respectively; we scale this multigraph back by 1/C to obtain a feasible solution to (fTTI ) whose cost is no 
greater than c(x*). □ 

We are now ready to apply the analyses of Goemans |[T2l and Bienstock et al. 161. Let x* and y* be an 
optimal solution to (ITOl ). 

Lemma 7. Let be an approximation algorithm for the s-t path TSP that produces a path of cost at 
most p times the Held-Karp optimum. Let V-y = {v\y* > 'j} for some < 7 < 1. Running .s^^ on the 
subgraph induced by V^yL) {s, t} yields a path P with c{P) < ^c(x*). 

Proof. The proof is basically the same as [6]. Observe that ^ is a feasible solution to (fT2)) . as can be seen 
from (ITOl) and ([T2l ). From Lemma[5]and Observation |2j the Held-Karp optimum for is of cost no greater 
thanc(^). □ 

The primal-dual algorithm of Chaudhuri et al. fj\ can be used to obtain the following performance 
guarantee for the metric prize-collecting s-t path problem. 

Lemma 8 (EJISl). There exists a polynomial-time algorithm that produces an s-t path P satisfying 

c{P) + 7r(y \ V{P)) < 2c(x*) + ^(1 - y*). 
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Now, the combined algorithm is as follows: let a := p and v ■= _ j^e algorithm runs 

£/pD with probability p; otherwise, it computes an optimal solution x* and y* to (ITOl) . samples 7 uniformly 
at random from (a, 1), and run s^p on the subgraph induced by = {f |yj > 7}. 

This algorithm can be derandomized since there are only 0(|1/|) different V^'s possible. 

Theorem 6. Let s/f be an approximation algorithm for the s-t path TSP that produces a path of cost 



at most p times the Held-Karp optimum, for some | < p < 2; then, there exists a 

approximation algorithm for the metric prize-collecting s-t path problem. 

Proof. The given algorithm is a polynomial-time algorithm. Let P denote the output path. 
It can be easily verified that < a < 1 and < p < 1. From Lemma|7J 



P 



E[c(P)|i^^ is chosen] < E[-c(x* 

7 



pc{x*) 

— In a 
1 - a 



a 1-07 



pc{x*). 



is chosen] 



(13) 



We have 



E[-k{V \ V{P))\£^P is chosen] 



< 



vi^V\{s,t} 
vi=V\{s,t} 

1 — a 



V ■ mm 



,1 



(14) 



From (n^, (O, and Lemma [S 



E[c(P)+7r(y\y(P))] 



p[2c(x*)+7r(l-y*)] + (l-p) 



In a 



1 



pc{x*) + 



1 



-vr(l-y* 



, — In a 
2p+{l-p)- -p 



cxn + 



1-2 

p — e p 



I- a 
[c(x*)+7r(l-y* 



p+{l-p) 



1 



1 - a 



7r(l-y*) 



□ 



Theorem |6] along with Theorem [T] yields the following: 



Corollary 4. There exists a deterministic 1.9535-approximation algorithm for the metric prize-collecting 
s-t path problem. 



Corollary 5. The integrality gap of (1101 ) is smaller than 1.9535. 
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4.2 Unit-weight graphical metrics 

In this subsection, we study the s-t path TSP for a special case where the cost function is a shortest-path 
metric defined by an underlying undirected, unit-weight graph. 

Let X* be an optimal solution to the path-variant Held-Karp relaxation; Go be the underlying unit- 
weight graph defining the cost function. Go is connected. 

Mucha |[T9l gives an improved analysis of the 1.5858-approximation algorithm of Momke and Svens- 
son ifTSl : following is from ifTOl . 

Lemma 9 ( 11191 ). There exists an algorithm s^ofor the s-t path TSP under unit-weight graphical metrics, 
which returns a solution of cost at most 



This immediately gives a (y| + e)-approximation algorithm for any e > 0. y| < 1.5834. 

Theorem 7 (| 19|). There exists a {^ + e) -approximation algorithm for the s-t path TSP under unit-weight 
graphical metrics, for any e > 0. 

Proof. Let P be the output of From Lemma |9l 



where the last line holds since c(e) > 1 for all e. 

Thus, there exists uq such that c(P) < (y| + e)c{x*) for all input that has no or more vertices. Smaller 



It can be observed from Lemma |9] and Theorem |7] that the "critical case" determining the proven per- 
formance guarantee is when c{x*) ~ | V^|. We will show that three different constructions of Hamiltonian 
paths carry performance analyses with complementary critical cases. 

Even though r-narrow cuts function as a mere analytic tool in Section [3l we propose an algorithm that 
actually computes the r-narrow cuts and utilize them: once the r-narrow cuts are computed, the algorithm 
constructs an s-t path that traverses from the first layer to the last, without "skipping" any layer in-between. 
If the path is inexpensive, the number of r-narrow cuts is also small so the algorithm presented in Section[3] 
produces a good solution. If the path is expensive but the Held-Karp optimum is close to \ V\ — 1, then 
we prove that the path already contains a large number of vertices and therefore can be augmented into a 
spanning Eulerian path with small additional cost. Lastly, if the Held-Karp optimum is bounded away from 
\V\ — 1, then Momke & Svensson's algorithm performs well provided that the graph has large number of 
vertices. 

Algorithm [T] shows the entire algorithm (except the separate handling of small instances); 9 G (0, 1) is 
a parameter to be chosen later. Let rj : E ^ Z>o be a function such that r/(e) := c(e) — 1. For U C V, 
G{U) denotes the subgraph of G induced by U. Suppose \V\ > 3; this implies £ > 3. 





-c(x*) + -c(x*) + — 
6 ^ ^ 4 ^ ^ 12 



< 



instances can be separately solved. 



□ 
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Algorithm 1 The algorithm for the s-t path TSP under unit-weight graphical metrics 
Input: Complete graph G = {V, E) with cost function c : E ^ ^>o; endpoints s,t ^ V. 
Output: Hamiltonian path between s and t. 

1 Run let Hy^ be the output Hamiltonian path. 

2 X* -^an optimal solution to the path- variant Held-Karp relaxation 

3 Run the algorithm from Section |3l let Hb be the output Hamiltonian path. 

4 Compute the partition Li, . . . defining all the (1 — 0)-narrow cuts C/j. 

5 for 1 < i < £ do 

6 Let (pi, gfj+i) be the shortest edge in E{Li, Lj+i), where pi G Li and qi^i G L^+i. 

7 end for 

8 for 1 < i < £ do 

9 Let Pi be the shortest path from qi to pi within G(Li), under edge cost given by r]. 

10 end for 

11 Let Plt be an s-t path obtained by concatenating (s, (72), ^2, {P2, Qs), ■ ■ ■ , Pe~i, {pe-i,t)- 

12 Ge^{V,Pit) 

13 while the multigraph G^; is not spanning do 

14 Choose {u, v) such that: c(n, v) = 1, u is isolated in Ge, and u is not. 

15 Add two copies of (n, to G^;. 

16 end while 

17 Shortcut an Eulerian path of G^; to obtain a Hamiltonian path He- 

18 Let Hout be the best among Ha, Hb and He; output //out- 



Lemma 10. Algorithm\J}is a well-defined, polynomial-time algorithm. 

Proof. Steps [T3lfT6] start with an s-t path, and augment it into a spanning multigraph that has an Eulerian 
path between s and t. This follows from the preservation of the parity of degree. Choice of {u, v) satisfying 
c{u, v) = lis always possible since Go is connected. 

Plt is an s-t path since Li = {s} and Li = {t}. Note that some of Pi's may be a length-0 path. 

SteplH unlike the algorithm from Section[3l actually computes the layered structure of (1 — ^)-narrow 
cuts, whereas this structure was only for the sake of analysis in Section [3l Yet, the layers can in fact be 
identified via a polynomial number of min-cut calculations; hence, the algorithm is a polynomial-time 
algorithm. □ 

Lemma 11. 

x*{E{Li,L2))>9. 

Proof. We have 

x*{E{Li,L>3)) + x*{E{L2, L>3)) = x*{6{U2)) < 2 - (15) 

and 

x*{E{Li,L2) + x*{EiL2,L>3)) = x*{6{L2)) > 2. (16) 

From ([T5]l and (fT6l ). 

x*{E{Li,L2))-x*{E{Li,L>s))>0. 

□ 

By symmetry, x*{E{L£_i, Li)) > 6. 
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Lemma 12. For any i > 1, j < i, Vi ^ % and V2 / such that 

1. i + 2<j, 

2. ViUV2 = 'Jill^iLk, and 

3. VinV2 = ^, 

then x*{E(yi,V2)) > 9. 
Proof. We have 

x*(^(L<i,Vi)) + x*(^(L<i,y2)) + a;*(^(L<,,L>j)) = :r*(,5(L<i)) < 2 - 0; 
by symmetry, 

X* (^(L<„ L>, ) ) + X* (^(l/i , L>j ) ) + X* (^(1/2 , i>> J ) ) < 2 - ; 
x*{E{L<^,Vi))+x*{E{Vi,V2)) + x*{E{Vi,L>j))=x*{8{Vi)) > 2; 
again by symmetry, 

x*{E{L<i,V2)) + x*{E{Vi,V2))+x*{E{V2,L>j)) > 2. 
From (U} through (l20l ). 

2x*{E{Vi,V2)) - 2x*{E{L<,,L>j)) > 29. 



(17) 

(18) 
(19) 

(20) 



□ 



Corollary 6. For all 1 < i < i, x*{E{Li, Li+i)) > 9. 

Proof. From Lemma [TT] and Lemma [12] applied for j — i = 3. □ 

Corollary 7. For all i, G{Li) weighted by (the projection of) x* is 9 -edge-connected. 

Proof. Li and are singleton; every cut in any other nonsingleton layer subgraphs are of capacity at least 
9 from Lemma [T2I applied for j — i = 2. □ 

Let fj, K > be some parameters to be chosen later. 



Lemma 13. 



c(-ffout) < max < 



5 3 \ , 7 ^ 

6 + 4(rT^J^(^)+12 



3 + 29 il-9f 



> . 



+ 



2 + 9 ' 4(2 + 0) ''^^^^ 
Proof. Suppose c{x*) > (1 + — 1); from the proof of Theorem|71 

c{Hout) < c{Ha) 

< lcixn + li\v\-i) + l^ 

^ (^4(TT^)^(^*) + ^' 

thus, we can assume from now that c(x*) < (1 + — 1). 
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Case 1. 



c(Plt) > '^(IV'I - 1). 
From Corollary |6] and the choice of (pj, qi+i), 

9 ■ r]{pi,qi+i) < {r] * x*){E{Li,Li+i)). 



(21) 



(22) 



For each layer Lj with 1 < i < £, consider a bidirected flow network on G{Li) whose capacities are given 
by X*. From Corollary |7J we can route flow of 6 from qi to pi. This flow can be decomposed into cycles 
and paths from q^ to p^; thus, by the choice of Pi, 



From (122JI and (123]), 



7]{P.)<{r]*x*){E{L,)). 



l<i<i i<i<e 

< (r? * x*){EiLi, L,+i)) + (r? * x*)(i?(L,)) 
i<i<e i<i<e 

< {r]*x*){E) 

= c{x*)-x*{E) 

= c{x*)-m-i) 



(23) 



(24) 



Let G'^ be Ge after finishing the execution of Steps [T3lfT6] of Algorithm [TJ |-Plt| denotes the number 
of edges on Plt- We have 

c{Hout) < c{Hc) 

< c{G'e) 

= c(PLT) + 2[(|y|-l)-|PLT|] 

= c(PLT) + 2[(|y|-l)-{c(PLT)-??(PLT)}] 

= 2(|F|-l)-c(PLT) + 2r?(PLT) 

2a' 



< 



2- K + 



< {2-K+ 



2a 



\V\-1) 



c{x*), 



where the second last line follows from (|2T]) and (l24l) : the last from c{x*) > — 1. 
Case 2. 

c(Plt) < ^(1^1 - 1). 
Note that, from the construction of Pu, i — 1 < \Pn\', hence we have 



(25) 



1 < I^ltI < c(Plt) < Ki\V\ - 1). 
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From each (1 — 0)-narrow cut {Ui, Ui), we can pick an edge di G 5{Ui) with c{di) = 1 due to the 

connectedness of Gq. Let := ed^, a := /3 := and r = — 1 = 1 — 9. Note that this 

choice of a and f3 satisfies ([8]l. Since the second condition on of Lemma[3]is not used to derive 
^ (it is used in the later part of the proof), we have 

C(^out) < c{Hb) 

< E[c(F)] 

< (1 + a + (3)c{x*) + { max^u; [1 - {2a + + a;)}] | c flj^j 



^2 



'£-1 



3 + 20 {1 — U J I ■sr-^ 



As c(di) = 1 for all i, 



^ 3 + 20 , (1-^)^ 

^ [3 + 20 (l-^)M / *N 



□ 



Corollary 8. Lef p := max 1 1 + 4(14^' 2 — k + + 4(2+0) T"'?^''^ exists a {p + e) -approxi- 

mation algorithm for the s-t path TSP under unit-weight graphical metrics, for any e > 0. 

Corollary 9. There exists a 1.5780-approximation algorithm for the s-t path TSP under unit-weight graph- 
ical metrics. 

Proof. Directly follows from CorollarylD if we choose, for example, 9 = 1.2297 x 10^^, a = 7.2774 x 
10^3^ and K = 5.4045 x 10"\ we have p < 1.5780. □ 

Corollary 10. The integrality gap of the path-variant Held-Karp relaxation under the unit-weight graphi- 
cal metric is smaller than 1.6137. 

Proof. Trivial for \V\ = 2. Let OPT denote the optimal (integral) solution value. 

Suppose 3 < \V\ < 6. From a similar argument as in the proof of Lemma [TOl if there exists a simple 
s-t path with m edges in Go, OPT < m + 2{\V\ — 1 — m) = 2\V\ — 2 — m. Thus, if there exists a simple 
s-t path with at least two edges, 

OPT ^ ^ ^ 8 ^ ^^3^ 

c(x*) ~ 1^1 - 1 ~ 5 

Suppose there does not exist a simple s-t path with more than one edge; then {s,t) G Gq and (s,t) is 
a bridge of Gq. Let ([/, U) be the s-t cut defined by the removal of (s,t) from Gq. x*{s,t) = since 
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2x*{s,t) = x*{6{{s})) +x*{6{{t})) -x*(6{{s,t})) < 1 + 1 - 2 = 0; therefore, 



and 



c{x*) = {c*x*){6{U)) + {c*x*){E\6{U)) 

= (c * x*){6{U) \ {s, t}) + {c* x*){E \ 6{U)) 

> 2x*{d{U)\{s,t})+x*iE\6iU)) 
= x*{6{U)) +x*iE) 

> \v\ 



OPT 2\V\-3 3 



Suppose \V\ > 7. Choose 9 = 3.7304 x 10'^, a = 8.5757 x IQ-^, and k = 8.4614 x lO'^; from 
Lemma [T3l 



c{Hout) < max < 



5 3 7 

6 ^ 4(l + a) ^ 12(|y| -l)(l + f7) 

2cr~ 



c[x 



2- K + 



3 + 29 (1 

+ 



c{x*) 



2 + 9 4(2 + 9) 



cix 



< Qc{x* 



for some Q < 1.6137. 



□ 



5 Open questions 

An immediate open question is in improving the performance guarantee. The fractional T-join dominators 
constructed in the analyses are not directly derived from the algorithm; a different construction may lead 
to an improved performance guarantee. One related question is whether a and /3 can be chosen differently. 

In the proof of ^^-^^^ -approximation, Lemma |3] can be considered as distributing c(x*) over the cuts of 
different capacities. An adaptive choice of a and /3 after seeing one such distribution does not appear to 
improve the analysis; from Yao's Lemma, oblivious but stochastic choice of a and /3 does not either. 

A bigger open question is whether the techniques presented in this paper can be extended to the circuit 
case as well. Given the successful adaptation of the techniques devised in one variant to the other in the 
unit-weight graphical metric case, whether the present techniques can be extended to beat the longstanding 
3/2 barrier of the general-metric circuit problem becomes an interesting question. It appears that the 
layered structure of r-narrow cuts or the parity argument on them are less likely to directly extend to the 
circuit case, as the arguments rely on the characteristics of the path case; what could be more promising 
is the approach of repairing deficient cuts using a set of vectors obtained from an auxiliary flow network, 
since this approach might extend to work with some different type of "fragile cut structure". 
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A An LP-based new analysis of the path- variant Christofides' algorithm 

In this appendix, we present a new analysis of the path- variant Christofides' algorithm |l8][l7l for the metric 
s-t path TSP, and show how the critical case characterized by this analysis can lead to an improvement. The 
analysis compares the output solution value to the LP optimum of the path-variant Held-Karp relaxation, 
thereby proving the upper bound of 5/3 on the integrality gap of the path- variant Held-Karp relaxation. 
We note that the LP optimum is never computed by the algorithm. 

First we recall the following definition of the circuit-variant Held-Karp relaxation: 

Definition 6 ([ 16]). The circuit- variant Held-Karp relaxation is the following: 

minimize c{x) 

subject to x{5{S)) > 2, V5 C F, 5 ^ 0; 

x{5{{v})) =2, Vt; E V- ^ ^ 

X > 0. 

Let G = {V, E) be the input complete graph with cost function c : E ^ M+ and the endpoints 
s,t ^ V. The path-variant Christofides' algorithm first finds a minimum spanning tree of G; it then 
computes a minimum T-join J, where T C V is the set of the vertices with the "wrong" parity of degree 
in J^^mm■ i-C-, T is the set of odd-degree internal points and even-degree endpoints in ^min- Lastly, the 
algorithm shortcuts an Eulerian path of the multigraph U J to obtain the output Hamiltonian path H. 

We give two different bounds on the cost of J, which together will establish the performance guarantee. 
Let X* G be the LP optimum of the path- variant Held-Karp relaxation. 

Lemma 14. c(5Jnin) < c{x*). 

Proof. As can be seen from Observation [T] the path-variant Held-Karp polytope is contained in the span- 
ning tree polytope. The lemma follows from this observation, since -^mm is a minimum spanning tree. □ 

Lemmas [T5] and [T6] give the two bounds. 
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Lemma 15. c( J) < ^ {c{x*) + c(s, t)}. 

Proof. Let a;*^^.^^^^ := x* + 1): i.e., is obtained by "adding" the edge (s, t) to x*. Then x*;^.^^;^. 

is a feasible solution to the circuit-variant Held-Karp relaxation (see ([T]) and (l26l)). Let HKdrcuit be the 
optimal value of the circuit- variant Held-Karp relaxation and we have 

C{J) < ^HKcircuit 

= l{cix*) + c{s,t)}, 

where the first inequality follows from ll22ll2TI . □ 
Lemma 16. c(J) < c{x*) — c{s,t). 

Proof. Let P^f™'" be the path between s and t on ^^min- Consider an edge set J' := 5^nin \ Pgf^''^- Note 
that J is a T-join: v G V has even degree in P^^""" if and only if v is internal; thus, v has even degree in 
the multigraph ^min LI J' = {,%am U ^min) \ -Psf™'" if ^"^^ o'^ly if ^ i^ internal point, and this shows 
that V has odd degree in J' if and only if w G T. 
We have 



c(J) < c(J') 



< c{x*) - c{s,t). 

The last inequality follows from Lemma [T4l and the triangle inequality. □ 

Theorem 8. c{H) < |c(x*); therefore, the path-variant Christofides' algorithm is a 5 / 3-approximation 
algorithm, and the integrality gap of the path-variant Held-Karp relaxation is at most 5/3. 



Proof We have 



C{H) < C(5'min) + c( J) 
< c{x*) + min 

= — c(x*) + min 
3 ^ ' 



]^{c{x*) + c{s,t)} ,c{x*) - c{s,t) 



< \c{x*), (27) 

where the second inequality follows from Lemmas [141 [15] and [161 D 

We observe that the equality of (l27l ) is achieved when c(s, t) = |c(x*), and this is the critical case of 
this analysis that determines the performance guarantee proven. Hence, if we can improve the performance 
guarantee only near this critical case, such an improvement would lead to a better approximation ratio. We 
demonstrate this approach, by presenting how this analysis combines with the results of Oveis Gharan et 
al. Il20i on the unit-weight graphical metric TSP to yield a comparable result in the s-t path TSP. 
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We consider the s-t path TSP under the unit- weight graphical metric; we show how to modify the 
algorithm of Oveis Gharan et al. for the path case and that, when c(s, t) is close to ^c{x*), this modified 
algorithm carries a performance guarantee that is slightly better than 5 /3. 

First we review the results in Oveis Gharan et al. |20|. In the following, the parameters ei, e2, 7, 5 and 
p can be chosen as follows: ei = 1.875- 10"i2 e2 = 5-10-2,7 = W-'^,5 = 6.25 - IQ-^^, p = 1.5-10-^4, 
and n denotes \ V\. 

Definition 7 (Nearly integral edges). An edge e is nearly integral with respect to x £ j/xg > 1 — 7. 

Definition 8. For some constant f < and k > 2, a maximum entropy distribution over spanning 
trees with approximate marginal x G is a probability distribution fi defined by X ^ such that 
ni ''^) oc rieGT ^efor every spanning tree ^ and the marginal probability of every edge e is no greater 
than (1 + ■^)xe. 

Definition 9 (Good edges). A cut is {\+5)-near-minimum if its weight is at most {'i+5) times the minimum 
cut weight. An edge e is even with respect to F d E if every (1 + 5) -near-minimum cut containing e has 
even number of edges intersecting with F. 

For a circuit-variant Held-Karp feasible solution consider as the edge weight and let 

F be a spanning tree sampled from a maximum entropy distribution with approximate marginal (1 — 
n)'^circuif ^'^y '^^ sdge 6 is good with respect to a^*ij.cuit '/^^^ probability that e is even with respect to 
F is at least p. 

Theorem 9 (Structure Theorem). Let x*;^,^,^;^ be a feasible solution to the circuit-variant Held-Karp re- 
laxation, and let fi be a maximum entropy distribution over spanning trees with approximate marginal 
(1 — ^)x*jj.p^;^.. There exist small constants ei, e2 > such that at least one of the following is true: 

1. there exists a set E* C E such that x{E*) > ein and every edge in E* is good with respect to 

circuit' 

2. there exist at least (1 — e2)n edges that are nearly integral with respect to 

Lemma 17. Suppose that Case 1 of Theorem [9] holds and IT is sampled from fi. Let T be the set of 
odd-degree vertices in then a minimum T-join J satisfies 

E[c(J)] < c(x:i,,,iJ(^ - ^(^). 

We are now ready to present the algorithm. Algorithm [2] describes the entire algorithm for the s-t path 
TSP under the unit-weight graphical metric. It first computes the LP optimum x*. If c{s,t) is close to 
^c(x*), we run a modified version of Oveis Gharan, Saberi, and Singh's algorithm (Cases Al and A2); 
otherwise, we invoke Christofides' algorithm (Case B). Parameters u/, au and 62 are to be chosen later. 

First we show that we can have a Structure Theorem analogous to Theorem |9] by adjusting €2 and 
replacing n with (n — 1) in Case 2. The following corollary states that either there are good edges of 
significant weight with respect to x*;^.^,^;^^ or there are many nearly integral edges with respect to x* . 

Corollary 11. Let x* be a feasible solution to the path-variant Held-Karp relaxation and x*^^^^^:^^ := 
X* + 6(5^-). Let n be a maximum entropy distribution over spanning trees with approximate marginal 
(1 — ^)x*jj.(,^;^.. There exist small constants ei, £2 > such that at least one of the following is true: 
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Algorithm 2 Algorithm for the s-t path TSP under the unit-weight graphical metric 
Input: Complete graph G = {V, E) with cost function c : E ^ ^>o; endpoints s,t ^ V. 
Output: Hamiltonian path between s and t. 

1 X* -^optimal solution to the path-variant Held-Karp relaxation 

2 if c(s, t) = (| + a)c(x*) for a £ [— cr^, cr„] tlien 

3 if at least (1 — e2)(n — 1) edges are nearly integral w.r.t. x* then {Case Al} 

4 Find a minimum spanning subgraph F' containing all the nearly integral edges. 

5 Find a minimum spanning tree £^ of F'. 

6 Let T be the set of odd-degree internal points and even-degree endpoints in 3^. 

7 Compute a minimum T-join J; ^ U J. 

8 else {Case A2} 

^ ^circuit •= ^ + ^(s,t) 

10 Sample spanning tree ^ from max-entropy distribution with approx. marginal (1 — 

11 Let T be the set of odd-degree vertices in 3^. 

12 Compute a minimum T-join J; ^ U J. 

13 if {s, t) G ^0 then ^ ^ {(s, t)} else if ^ U {(s, t)} end if 

14 end if 

15 else {Case B} 

16 Find a minimum spanning tree J7 of G. 

17 Let T be the set of odd-degree internal points and even-degree endpoints in ^ . 

18 Compute a minimum T-join J; if U J. 

19 end if 

20 Shortcut an Eulerian path of the multigraph if to obtain a Hamiltonian path H; output it. 



1. there exists a set E* C E such that x{E*) > ein and every edge in E* is good with respect to 

circuit' 

2. there exist at least (1 — e'2j{n — 1) edges that are nearly integral with respect to x*. 

Proof. By Theorem|9l at least one of the two cases of Theorem |9] holds. Case 1 of Theorem|9]and Case 1 
of this corollary are identical, so consider when Case 2 of Theorem |9] holds. 
Recall that e2 was chosen as 5 • 10~^; we choose £2 = 6- 10^^. 

Suppose n < 19. x*;^.^,^;^ has at least (1 — e2)n nearly integral edges; thus, x* has at least [(1 — €2)^] — 
1 = n — 1 > (1 — eg) (n — 1) nearly integral edges. 
Suppose n > 20. x* has at least 

(l_e2)n-l = (l-e2)(n-l)-e2 

> (l-§^2)(n-l) 

> (l-e'2)(n-l) 

nearly integral edges. □ 

Lemma 18. In Case Al, c{H) < (| — Cai)c{x*) for some cai > 0. 

Proof. The following proof is adapted from |[20ll and modified for the path case. 
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Let S' be the set of nearly integral edges. Since the metric is defined by an unweighted connected 



graph, c(F') = c{S') + \F' \ S'\ < + \F' \ S'\. From 7 < i, we know that S' is a union of 

disjoint cycles and paths and the lengths of cycles are at least -. Thus, n <S"| > (ra — 1)(1 — e2)(l — 7) 
and \£r \ S'\ <{n- l)(e'2 + 7) < c(x*)(e'2 + 7). Let S = s''n^. 
We construct a fractional T-join dominator y as follows. 



We claim that y is a fractional T-join dominator. Let {U, U) be any cut that has an odd number of vertices 
in T on one side. If there exists an edge e G ^{U), then y{6{U)) > ye = 1- So suppose from 

now on that S{U) cS. Then S{U) nS = d{U) n ^. 

If U is nonseparating, U contains odd number of odd-degree vertices, and thus \6{U) H ^\ is odd. We 
have X* {S{U)) > 2 from the Held-Karp formulation and thus 



If ([/, U) is an s-t cut, then U contains even number of odd-degree vertices, and thus \S{U) n is 
even. We have {S{U) fl ^) ^ since ^ is connected and 





y{S{U)) > y{6{U) n 5) > 2 



1-7 



= 1. 



2(1-7) 



Thus y is a fractional T-join dominator. Now, 




{c*x*){S) 
2(1-7) 



for some Cai > 0. For example, we can choose cai = 4 • 10 



□ 



Lemma 19. In CaseAl, E[c(ii")] < (| - CA2)c{x*)for some Ca2 > 0. 



Proof. First we have 
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From Lemma [TtI 



We have 



E[ciJ)] < + a)cix*){} ^^^^ 



^3 ' ' '^2 A{l + 6)' 

Pr[(s,t)GLo] > Pr[(s,t)G^] 

= n-l-n\^\{sM 

> n-l-(n-2 + -)(l + 4) 

> »-l-(n-2 + i)(l + ^; 



> 1 



7 

5n 



and hence 



E[c{H)] < E[c{^)] + E[c{J)]-{l-^)c{s,t) + ^c{s,t) 

/ *^ f/-, 4 , ,4 , ,4 ,,1 eiJp 
< c(x*) (1 - _)(- + a) + (- + a)(- 



5n"3 ' '3 ''2 4(1 + 5)- 
7 , ,1 , 7,1 
-(^-5;^)(3+")+5;^(3+«) 
f _ei5p . , A ei6p 12 

= ^("H^3-3(rT5)) + "^2-4(TT5))-n(T5-'") 

< c{x*)il-CA2) 

for some Cyi2 > by choosing sufficiently small o"/, o"^ > 0. For example, we can choose ai = 7.8- 10^^^, 
au = 3.9 • 10-52 and Ca2 = 3.9 • lO'^^ □ 

Lemma 20. In Case B, c(H) < (| - CB)c{x*)for some Cb > 0. 

Proof. Suppose that c(s, t) < — ai)c{x*). From Lemmas [T4l and [T5l it follows that 

c{H) < c(=^)+c(J) 

< <x*) + ^!^c{x*) + {^-ai)c{x*) 

3-tJ 

Suppose c(s,t) > (| + au)c{x*). From Lemmas [T4l and [T6l 

c(i/) < c(^) + c(J) 

< c(x*) + |c(x*)-(^ + a„)c(x*) 

Now choose Cb '■= m.m{^, au)- □ 
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Lemmas [TSl [T9l and [20l yield the following theorem. 

Theorem 10. For some e > 0, Algorithm^is a (| — e) -approximation algorithm for the s-t path TSP 
under the unit-weight graphical metric. 

Proof. In Cases Al and B, the multigraph is the union of a spanning tree and a T-join where T is the set 
of the vertices with the wrong parity of degree. Thus, ^ has an Eulerian path between the two endpoints. 

In Case A2, is Eulerian and hence 2-edge-connected; ^ D .ifo \ {{^it)} is therefore connected 
and ^ has an Eulerian path between the two endpoints. 

By choosing e = minjCAi, Ca2, Cb], e = 3.9 • 10~^^ for example, we have E[c(i7)] < (| - t)c{x*) 
from Lemmas [T8l [19] and |20l Thus, Algorithm |2] is a (| — e)-approximation algorithm. □ 
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